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The no-cloning theorem [l[ , as sharpened in 0] , states 
that there is no way of blindly copying a pair of 
nonorthogonal pure states. More precisely, for any pair 
of nonorthogonal pure states pi,i 6 {1,2}, there is no 
trace-preserving completely positive map £ such that 
Vi,£(pi) — Pi ® Pi- In a classical context, with proba- 
bility distributions replacing density operators, universal 
cloning of pure states is possible (though in the infinite- 
dimensional case, this may be an unphysical idealization 
@, 13)' but even here, universal cloning producing inde- 
pendent copies is impossible if we require mixed states 
to be cloned also. A set of mixed states that can be 
cloned, in classical or quantum mechanics, must be mu- 
tually orthogonal — their density matrices or probability 
distributions must have nonoverlapping support [j| . 

Broadcasting is a weaker notion than cloning. A map 
£ that takes states on H to states on Ha <& broad- 
casts a state p if Tr^ (£(p)) = Tr# (£ (p)) = p, i.e. we 
do not require that the final state is a product state. A 
set of states is broadcastable if and only if there is a map 
£ that broadcasts each state in the set [25j| . This gen- 
eralizes pure-state cloning to mixed states in a way that 
picks out "classical" sets of states better than requiring 
independent copies does: the no-broadcasting theorem 
5] says not only that universal broadcasting is impossi- 
ble, but also that a set of states is broadcastable if and 
only if they commute pairwise. For the analogous classi- 
cal notions, universal broadcasting can be achieved (with 
minor caveats about the infinite dimensional case [J]). 

Cloning and broadcasting are two of the most elemen- 
tary information-theoretic tasks, and the impossibility of 
universal cloning and broadcasting are two of the more 
significant ways in which quantum theory differs from 
classical theory. In order better to understand these dif- 
ferences, and how they may underpin the better-than- 
classical performance of quantum information processing, 
it is helpful to consider the information-theoretic proper- 
ties of probabilistic theories that are neither classical nor 
quantum. This has lately been a very active area of in- 
vestigation in quantum information theory @, H> & EJ • 

Therefore, it is natural to ask whether the no- 



broadcasting theorem is a special feature of quantum the- 
ory, i.e. something that identifies it uniquely from a space 
of surrounding alternative theories, or generic, so that the 
possibility of universal broadcasting is special to classical 
theories. We show that the latter is the case. Working in 
a very general framework that assumes little apart from 
the convexity and finite dimension of state spaces, and 
the no-signaling principle for bipartite systems, we show 
that a set of states is broadcastable if and only if it is con- 
tained in a simplex generated by states that are jointly 
distinguishable via a single-shot measurement. This re- 
duces to the standard no-broadcasting theorem in the 
quantum case, and implies that universal broadcasting 
is only possible in classical theories. The resulting proof 
of the quantum no-broadcasting theorem is significantly 
simpler than the original proof of [B[ , and more intuitive 
and self-contained than that based on Lindblad's theo- 
rem [ll| (although the latter provided useful ideas). 

An operational framework. Before proving the main 
theorem, we describe a mathematical formalism involv- 
ing only minimal constraints that accommodates a wide 
range of possible probabilistic theories. Our approach is 
based on convex sets, and has a long pedigree a 12, 13j |. 
although the investigation of information processing in 
this context is more recent (see [3] and in particular , 
where a no-cloning theorem is derived for all nonclassical 
theories) . The framework is broad enough to include the- 
ories with "Popescu-Rohrlich" or "nonlocal boxes" 0, @] , 
exhibiting stronger-than-quantum correlations. 

The basic approach is operational, meaning that no- 
tions such as system, state, and measurement are among 
its fundamental concepts. We associate with a given type 
of system a set Q of possible states. If it is possible to 
prepare a system in a state u>i or u>2, then it should be 
possible to prepare any probabilistic mixture of the two 
(say by tossing a biased coin, preparing one or the other 
according to the outcome, and then forgetting the out- 
come) so we assume that f2 is convex. 

Whatever else a state is, it should define probabilities 
for measurement outcomes. We write the probability of 
getting outcome e when the state is u) as e(u>). Sup- 
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pose that to — pui\ + (1 — p) L02 and that this mixed 
state is prepared by tossing a biased coin, as above. 
The probability of outcome e should be the weighted 
average of the probabilities of e given lo\ and u>2, i.e., 
e(ui) = pe(wi) + (1 — p)e(u>2). Therefore we identify 
e with an afhne linear functional Q — » [0,1]. We refer 
to any such functional as an effect. The wmi effect u 
is defined by u(lo) = 1 for all lo £ Q, and represents a 
measurement outcome that is certain to occur no matter 
what the state. The set of all effects is denoted [0,u]. 
A measurement corresponds to a set of effects {e^} such 
that J2i e i — u ! that is J2i e i{< jJ ) = 1 f° r all w S Q. 

In quantum theory, Q is the set of density operators on 
some Hilbert space. A particular measurement outcome 
is associated with a positive operator E bounded by 
and the identity /, such that the probability of getting 
this outcome for state p is given by Tr(E p). This does 
define an affine linear functional on the state space since 
if p = ppi + (1 -p)p 2 , then Tr(E p) = pTr(E p x ) + 
(1 — p) Tr(E p 2 ). The unit effect corresponds to / and a 
measurement as a whole corresponds to a set of positive 
operators summing to /, i.e. a discrete POVM. 

A transformation in quantum theory corresponds to 
a linear trace-preserving completely positive map taking 
states on a Hilbert space Ti. into states on a Hilbert space 
H' [26| . As with the rule for measurement outcomes, lin- 
earity ensures that transformations respect probabilistic 
mixtures of states. In the generalized framework, a trans- 
formation corresponds to an affine mapping T : Q — > Q' , 
where Q is the state space of the system prior to the 
transformation, and Q' is the post-transformation state 
space. One should not assume that all such affine maps 
correspond to allowed transformations in a particular 
theory. For example, in quantum theory only completely 
positive maps (not arbitrary positive maps) do. 

The set of all affine functionals Q — » R is a vector space 
denoted A(Q). There is a natural embedding of Q in 
A(Q)* (the dual space of A(Q)), given by to i— > to, where 
Lu(a) = a(co) for all a S A(Q). This enables us to identify 
to with to, writing either to(a) or a(u) as convenient. Let 
V(Q) be the linear span of Q in A(Q)* . Then, Q is finite- 
dimensional iff V(Q) is finite-dimensional. We assume 
state spaces are finite-dimensional and compact, which 
guarantees that Q is the closed convex hull of its extreme 
points (referred to as pure states) . 

A d-dimensional system is classical iff Q is the convex 
hull of d+ 1 linearly independent pure states (a simplex), 
in which case Q can be thought of as the set of proba- 
bility distributions over d + 1 distinct possibilities. Only 
in such systems can the extremal points be perfectly dis- 
tinguished from each other by a single measurement, a 
point discussed in the proof of Theorem 2 below. Classi- 
cal systems are also characterized by the fact that each 
state has a unique decomposition into extremal states. A 
theory is classical iff each system in the theory is classical. 

Joint systems. Suppose systems A and B have state 



spaces Qa and Qb- The joint system AB will have its 
own state space, Qab, but how are Qa, Qb and Qab re- 
lated? Assume: (i) a joint state defines a joint probability 
for each pair of effects (e^es), where E A(Qa) and 
cb 6 A(Qb) [271) : (ii) these joint probabilities respect the 
no-signaling principle, i.e., the marginal probabilities for 
the outcomes of a measurement on B do not depend on 
which measurement was performed on A and vice versa; 
(iii) if the joint probabilities for all pairs of effects (e^, es) 
are specified, then the joint state is specified. 

These assumptions do not determine Qab uniquely in 
general, but they do imply [H, [HI, HI, 3] that it must be 
a convex set whose span can be identified with the vector 
space V(Qa) ® V(CIb)- Further, it must lie between two 
extremes, the maximal and the minimal tensor product. 
The maximal tensor product, £Ia ®max ^b, is the set of 
all bilinear functionals (j> : ^(Sl^) x j4(£!b) — > R such 
that (i) </>(e, /) > for all pairs of effects (e, /), and (ii) 
4>{uatUb) = 1, where ua and ub are the unit effects for 
systems A and B. The maximal tensor product has an 
important operational characterization: it is the largest 
set of states in (A{VLa) ®A(Q,b))* assigning probabilities 
to all product measurements but not allowing signaling 
13L ljj . The minimal tensor product, Q.a ®min ^b , is the 
convex hull of product states, where a product uja ® 
is defined by (uja <S) wb)(«, b) = ujA{a)ujB(b) for all pairs 
(a, b) e A(Q A ) x A(Q B )- These appeared in [13] in the 
context of abstract compact convex sets. 

Joint states in f2 a ®min ^ b are separable and those in 

®max &b but not in Qa ®min ^b are entangled. 
A particular theory should specify, besides £1a and 
flB, a set of joint states Qab such that Qa ®min Qb Q 
Qab C Q a ®max Qb- We call this Qa <8> Qb, keep- 
ing in mind that this may be any convex set bounded 
by the minimal and maximal tensor products. If cither 
A or B is classical then Qa ®min Qb — Qa ®max Qb 
and there is no entanglement. In particular, if both are 
classical then both Qa ®min Qb and Qa ®max Qb are 
the simplex whose vertices are ordered pairs of an ex- 
tremal point of Qa and one of Qb ■ For quantum theory 
Qa ®m.in Q r C Qar C Q a ® m.nx Q r , where the inclusions 
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This treatment is easy to generalize to multipartite sys- 
tems, by allowing A and B themselves to be composite. 
Also, we can define unambiguously the notion of a re- 
duced (or marginal) state. Any state u>ab G QA®maxQB 
has reduced states to a and ujb defined such that lua(ci) = 
uiab(cl,ub) and uisib) = LUAB(uA,b). It is easy to show 
that if either reduced state is pure, then ujab = u>a®^b- 
Cloning and Broadcasting. We can generalize the def- 
initions of cloning and broadcasting given at the begin- 
ning. Consider a state space Q and a transformation 
T : Q — > Q ® Q. Denote the reduced states of T(uj) 
by (T(uj))a and (T(uj))b- We say T clones a state lo iff 
T(lo) = lo®lo. A set of states is cloneable iff there is a sin- 
gle map T such that T clones each state in the set. We say 
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T broadcasts a state lu iff (T(lo))a = (T(lo))b = And 
a set of states is broadcastable iff there is a single map T 
such that T broadcasts each state in the set. In addition, 
we say a set of states {u>%, . . . , Lu n } is jointly distinguish- 
able iff they can be distinguished with certainty with a 
single-shot measurement, i.e., there exists a measurement 
E with outcomes ei, . . . , e„ such that Wj(ej) = 

Theorem 1. For any finite dimensional state space SI 
and any choice of tensor product SI <g) S7, a set of states is 
cloneable iff it is jointly distinguishable. 

A rigorous proof of this theorem is given in [201 ] . Intu- 
itively, if a set of states is cloneable, then they may be 
distinguished by making many clones and then identify- 
ing the state with suitable measurements on the copies. 
Conversely, if the states are jointly distinguishable, then 
one way of cloning is to perform the measurement that 
distinguishes them and then to prepare two copies. 

Theorem 2. Universal cloning is only possible for clas- 
sical systems. 

Proof. Universal cloning implies that the set of all 
pure states is clonable, so any finite subset {u>i, . . . , u) n } 
is clonable. From Theorem 1, it follows that this set is 
jointly distinguishable, thus we can find affinc functionals 
ei, . . . , e„ such that u>i(ej) = 5ij. It follows that the u>i 
are linearly independent in V(f2). Since this holds for 
any finite subset of pure states, all pure states are linearly 
independent, and since V(f2) is finite dimensional there 
can only be a finite number of such states. Thus SI is a 
simplex and the system is classical. □ 

For any kind of system, broadcasting of pure states 
reduces to cloning because if u is pure and T(u>) has 
reduced states equal to to, then T(u>) = lu ® lu. So Theo- 
rem 2 implies that universal broadcasting is possible only 
for classical systems. Our main theorem goes further: it 
specifies exactly when a set of states is broadcastable. 

Theorem 3. A set of states is broadcastable iff it lies 
in a simplex generated by jointly distinguishable states. 

Proof. For the "if" direction, it is easily verified that 
the map that clones the extreme points of the simplex 
(cf. the discussion following Theorem 1) broadcasts the 
entire simplex. For "only if", consider B' : SI — > S7 ® S7, 
and denote by T 1 the set of states broadcast by B' . It 
is immediate from the definition of broadcasting that if 
B' broadcasts both io\ and u>2, then B' broadcasts any 
convex combination of them. Thus T' is convex. Let 
cr : f7 il — > rZ ® il be the swap operation, defined by 
a(u>A ® wb) — lob <8> ijJa- Define the symmetrized map 
B : n -> S7 ® SI by B = (B' + a o B')/2. Denote by T 
the set of states broadcast by B and note that fCT. 
The strategy will be to define, in terms of B, a map 
Q : T — > T® max T such that Q is universally broadcasting 
on r, hence cloning for all states extremal in Y. It is 
critical that Q's domain be T, rather than SI, because 
the extremal points of Y are not necessarily extremal in 
SI, and we need to use Theorem 1 to conclude that T's 
extremal points are cloned. 



Definition. A compression of onto a subset T is an 
idempotent affine mapping SI — > Q having range T. 

Lemma 1. Let T : Q — > 57 be any transformation 
taking SI into itself. Then there exists a compression of 
S7 onto the set of fixed points of T. 

Proof. For each n G N, let P n = i J2k=i Tk '■ 
Cl — ► 0. Since SI is compact, we may assume (pass- 
ing to a subsequence if necessary) that (P n ) converges 
to a limiting affine map P : SI — > fi. If T(ui) = lu, 
then clearly P(ui) — ui; conversely, if cj = P(</>) for 
some G n, then T(w) = li m?woo \ ^Li T k+1 (4>) = 
limbec i Y.ltl T k {4>) - lim„^ co ^T(cj>). We note that 
linin^oo —T((f>) = 0, and rewrite the first term as 
lim^ I ELi r fc (0)+hm„^oo ^T"+ 1 (</') = P(</>) = w. 
Thus, the range of P is exactly the fixed-point set of T, 
as claimed. Therefore, as P(4>) is a fixed point of T, we 
have P(P(<fi)) = P(4>) for any 0, i.e., P is idempotent. □ 

Continuing with the proof of Theorem 3, recall the 
symmetrized map B, which broadcasts states in T, and 
define a map Ba so that Ba(co) is given by the reduced 
state (B(lu))a- Note that oj € T iff to is a fixed point 
of Ba- By Lemma 1, there is a compression P onto T. 
Consider P as a map SI — ► T; there is a unique definition 
of P <g> P : SI ® m ax H — > T <8 ma a: T satisfying (P ® P) (w^ ® 
w B ) = P(cja) ® P(ws). Now define Q : T -> T ® TOa:c T 
by <9(t) = (P ® P)(B(j))- Q is universally broadcasting 
on T. For if 7 G T, we have, for all er G [0, Mr], 

QAh)(er) = Q(l)(er ® »r) = ((^ ® P)P(7))(e r 8> «r) 
= P( 7 )(P*e r (8 P*u r ) = BA^^er) = 7 (P*e r ) 
= P(7)(er) = 7(er), 

where P*e := e o P for arbitrary e, and the last step 
uses the fact that P( 7 ) = 7, since 7 G T. (Here [0, ur] 
is defined analogously to [0,u], but with T playing the 
role played by SI.) It follows that Qa{i) = 7; similarly, 
Qb(j) = 7- Since Q is universally broadcasting on T, it 
broadcasts T's extremal states. Broadcasting reduces to 
cloning for extremal states, so Q is universally cloning on 
the set of extremal points of T. It follows from Theorem I 
that the extreme points of T are jointly distinguishable 
in r, i.e. via an observable consisting of effects in [0, Mr]- 
We can "lift" effects er G [0, ur], defined only on the span 
V(r) of r, to functionals on V(Cl) via the map er 1— » 
er o P, which takes effects in [0,ur] to ones in [0,u]. 
Thus any observable on T lifts via P to one on SI. This 
includes the one that distinguishes the extreme points of 
r, so Theorem 3 follows. □ 

The proof uses the symmetrized P, rather than P', 
because the set T of states broadcast by B is the fixed- 
point set of P's marginal maps Ba = Bb, so Lemma I 
provides a compression P onto T. The set T' broadcast 
by B' is the intersection of the fixed-point sets of the 
marginal maps B' A and B' B which need not be equal, so 
Lemma 1 does not provide a compression onto it. 

Corollary (quantum no-broadcasting theorem). Let Y 
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be a set of density operators on a Hilbert space H. If 
there is a positive map T : 13(H) —* 13(H) broadcasting 
each pgr then the operators in T mutually commute. 

Proof. By Theorem 3, T is contained in a simplex 
generated by distinguishable, hence commuting, density 
operators. Hence the operators in T also commute. □ 

This result is stronger than that in [5j, which applied 
to completely positive maps rather than all positive maps. 

Theorem 3 tells us little about the convex structure of 
the set r of states broadcast by a map B. But Ref. 
builds on it to show that any such r is a simplex gener- 
ated by jointly distinguishable states. 

Conclusion. In order to understand the nature of in- 
formation processing in quantum mechanics, it is useful 
to demarcate those phenomena that are essentially quan- 
tum, from those that are more generically non-classical. 
This Letter has identified an important feature of quan- 
tum information that is generic: the no-broadcasting the- 
orem. Note that not every qualitative result of quantum 
information is similarly generic, e.g. teleportation is not 
possible in every probabilistic theory [1, Hl| . 



In [22| it was shown that the conjunction of no- 



signaling, no-broadcasting and no-bit-commitment im- 
plies the existence of noncommuting observables and en- 
tangled states for theories in a C* -algebraic framework, 
yielding theories quite close to quantum theory. However, 
this framework is already close to quantum theory, since 
all theories in it have Hilbert space representations and 
the finite-dimensional ones are just quantum theory, clas- 
sical probability and quantum theory with superselection 
rules. The framework adopted in this Letter is more nat- 
ural for pursuing the program of deriving quantum theory 
from information theoretic axioms [22LI23T]. as it is narrow 
enough to allow axioms to be succinctly expressed math- 
ematically, but broad enough that the main substantive 
assumptions will be contained in the axioms rather than 
in the framework itself. The framework assumes no- 
signaling, and we have shown that no-broadcasting holds 
for any nonclassical model within it. Such models can be 
very different from quantum theory, e.g. they may sup- 
port stronger-than-quantum correlations Q. An open 
question is whether no-bit-commitment is also generic, 
but in any case it seems unlikely that these three ax- 
ioms alone would get one particularly close to quantum 
theory. Thus our results suggest that future progress in 
characterizing quantum theory in terms of information- 
theoretic tasks is likely to require assumptions of a less 
generic character, such as the possibility of teleportation. 
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